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GenCore version 6.2.1 
Copyright (c) 1993 - 2008 Biocceleration Ltd. 



OM protein - protein search, using sw model 

Run on: June 24, 2008, 15:31:23 ; Search time 345 Seconds 

(without alignments) 
2090.033 Million cell updates/sec 

Title : US-10-552-515-l_COPY_157_933 
Perfect score: 4123 

Seguence: 1 QQDVQDGNTTVHYALLSASW SELSSHWTPFTVPKASQLQQ 777 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 4051641 segs, 928007118 residues 

Total number of hits satisfying chosen parameters: 4051641 

Minimum DB seg length: 0 

Maximum DB seg length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Published_Applications_AA_Main : * 

/ABSS/Data/CRF/ptodata/l/pubpaa/US07_PUBCOMB.pep: * 
/ABSS/Data/CRF/ptodata/l/pubpaa/US08_PUBCOMB.pep: * 
/ABSS/Data/CRF/ptodata/l/pubpaa/US09_PUBCOMB.pep: * 
/ABSS/Data/CRF/ptodata/l/pubpaa/US10A_PUBCOMB.pep: * 
/ABSS/Data/CRF/ptodata/l/pubpaa/US10B_PUBCOMB.pep: * 
/ABSS/Data/CRF/ptodata/l/pubpaa/USllA_PUBCOMB.pep: * 
/ABSS/Data/CRF/ptodata/l/pubpaa/USllB_PUBCOMB.pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or egual to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 
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Result Query 



No . 


Score 


Match 


Length 


DB 


ID 








Description 


1 


4123 


100 . 


. 0 


933 


5 


us- 


-10- 


-552- 


-515-1 


Secjuence 


1 Appli 


2 


3739 


90 , 


, 7 


885 


7 


us- 


-11- 


-599- 


-845A-700 


Secjuence 


7 00, App 


3 


3572 


86 , 


. 6 


843 


7 


us- 


•11- 


-599- 


-845A-698 


Secjuence 


6 98, App 


4 


3031.5 


73 , 


. 5 


898 


5 


us- 


•10- 


-450- 


-763-45847 


Secjuence 


45847, A 


5 


1502.5 


36 , 


, 4 


920 


4 


us- 


•10- 


-104- 


-047-2574 


Secjuence 


25 7 4, Ap 


6 


1502.5 


36 , 


, 4 


920 


6 


us- 


•11- 


-072- 


-512-2574 


Secjuence 


25 7 4, Ap 


7 


1498.5 


36 , 


. 3 


981 


6 


us- 


-11- 


-582- 


-861-10841 


Secjuence 


10841, A 


8 


1471 . 5 


35 , 


, 7 


981 


6 


us- 


-11- 


-443- 


-428A-801153 


Secjuence 


801153, 


9 


1471.5 


35 , 


, 7 


1046 


6 


us- 


-11- 


-582- 


-861-9875 


Secjuence 


98 75, Ap 


10 


1461.5 


35 , 


, 4 


840 


6 


us- 


-11- 


-177- 


-894-11 


Secjuence 


11, Appl 


11 


1456 .5 


35 , 


. 3 


960 


6 


us- 


-11- 


-177- 


-894-7 


Secjuence 




12 


1437 


34 , 


, 9 


999 


6 


us- 


-11- 


-443- 


-428A-774192 


Secjuence 


774192, 


13 


1417 


34 , 


, 4 


800 


6 


us- 


-11- 


-443- 


-428A-739452 


Secjuence 


739452, 


14 


1417 


34 , 


, 4 


800 


6 


us- 


-11- 


-443- 


-428A-739454 


Secjuence 


739454, 


15 


1416 


34 , 


. 3 


853 


6 


us- 


-11- 


-443- 


-428A-739456 


Secjuence 


739456 , 


16 


1378.5 


33 , 


, 4 


1219 


6 


us- 


-11- 


-097- 


-143-15228 


Secjuence 


15228, A 


17 


1369 


33 , 




910 


5 


us- 


-10- 


-484- 


-148-14 


Secjuence 


14, Appl 


18 


1367.5 


33 , 


' 2 


712 


6 


us- 


-11- 


-177- 


-894-10 


Secjuence 


10, Appl 


19 


1352 . 5 


32 , 


. 8 


891 


6 


us- 


-11- 


-582- 


-861-10193 


Secjuence 


10193, A 


20 


1344 


32 , 


. 6 


1075 


6 


us- 


-11- 


-097- 


-143-24771 


Secjuence 


24771, A 


21 


1159 . 5 


28 , 


, i 


1058 


6 


us- 


-11- 


-097- 


-143-21858 


Secjuence 


21858, A 


22 


1154 


28 , 


. 0 


596 


4 


us- 


-10- 


-104- 


-047-2541 


Secjuence 


2541, Ap 


23 


1154 


28 , 


. 0 


596 


6 


us- 


-11- 


-072- 


-512-2541 


Secjuence 


2541, Ap 


24 


1061.5 


25 , 


. 7 


594 


5 


us- 


-10- 


-631- 


-467-681 


Secjuence 


6 81, App 


25 


1061.5 


25 , 


. 7 


594 


5 


us- 


-10- 


-529- 


-348-1242 


Secjuence 


1242, Ap 


26 


1061.5 


25 , 


. 7 


594 


5 


us- 


-10- 


-917- 


-503-10953 


Sequence 


10953, A 


27 


1061.5 


25 


, 7 


594 


6 


us- 


-11- 


-177- 


-894-8 


Secjuence 


8, Appli 


28 


1043.5 


25 


. 3 


825 


6 


us- 


-11- 


-443- 


-428A-1026041 


Secjuence 


1026041, 


29 


1024.5 


24 , 


. 8 


782 


4 


us- 


-10- 


-066- 


-543-1424 


Sequence 


142 4, Ap 


30 


1024.5 


24 , 


. 8 


793 


6 


us- 


-11- 


-443- 


-428A-1026033 


Sequence 


1026033, 


31 


912.5 


22 , 


, l 


475 


4 


us- 


-10- 


-104- 


-047-3116 


Sequence 


3116, Ap 


32 


912.5 


22 , 


. 1 


475 


6 


us- 


-11- 


-072- 


-512-3116 


Sequence 


3116, Ap 


33 


905 


22 , 


. 0 


756 


5 


us- 


-10- 


-692- 


-382-446 


Sequence 


4 46, App 


34 


886 


21 , 


. 5 


573 


6 


us- 


-11- 


-443- 


-428A-774194 


Sequence 


774194, 


35 


881.5 


21 , 


. 4 


550 


6 


us- 


-11- 


-443- 


-428A-774193 


Sequence 


774193, 


36 


873.5 


21 


. 2 


642 


4 


us- 


-10- 


-108- 


-260A-4483 


Sequence 


4 4 83, Ap 


37 


873.5 


21 


. 2 


642 


6 


us- 


-11- 


-177- 


-894-9 


Sequence 


9, Appli 


38 


873.5 


21 , 


. 2 


642 


6 


us- 


-11- 


-293- 


-697-4483 


Sequence 


4 4 83, Ap 


39 


829 


20 


, 1 


631 


6 


us- 


-11- 


-443- 


-428A-739453 


Sequence 


739453, 


40 


819.5 


19 


.9 


443 


4 


us- 


-10- 


-264- 


-049-2917 


Sequence 


2917, Ap 


41 


801.5 


19, 


. 4 


853 


6 


us- 


-11- 


-443- 


-428A-1026043 


Sequence 


1026043, 


42 


784 . 5 


19 


.0 


390 


4 


us- 


-10- 


-264- 


-237-2758 


Sequence 


2758, Ap 


43 


735 


17 


.8 


139 


3 


us- 


-09- 


-957- 


-708-31 


Sequence 


31, Appl 


44 


735 


17 


.8 


139 


6 


us- 


-11- 


-230- 


-251-31 


Sequence 


31, Appl 


45 


691.5 


16, 


.8 


306 


6 


us- 


-11- 


-443- 


-428A-879642 


Sequence 


879642, 



ALIGNMENTS 
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RESULT 1 
US-10-552-515-1 

; Sequence 1, Application US/10552515 
; Publication No. US20060194204A1 
; GENERAL INFORMATION: 

; APPLICANT: The Government of the United States of America as 

; APPLICANT: represented by the Secretary of the Department of Health and 

; APPLICANT: Human Services 

; APPLICANT: Bera, Tapan K. 

; APPLICANT: Pastan, Ira H. 

; APPLICANT: Lee, Byungkook 

; TITLE OF INVENTION: GENE EXPRESSED IN PROSTATE CANCER AND METHODS OF USE 

; FILE REFERENCE: 4239-68223-02 

; CURRENT APPLICATION NUMBER: US/10/552,515 

; CURRENT FILING DATE: 2005-10-06 

; PRIOR APPLICATION NUMBER: PCT/US2004/10588 

; PRIOR FILING DATE: 2004-04-05 

; PRIOR APPLICATION NUMBER: 60/461,399 

; PRIOR FILING DATE: 2003-04-08 

; NUMBER OF SEQ ID NOS : 12 

; SOFTWARE: Patentln version 3.2 

; SEQ ID NO 1 

; LENGTH: 933 

; TYPE: PRT 

; ORGANISM: Artificial Sequence 
; FEATURE : 

; OTHER INFORMATION: Splice Variant-Novel Gene Expressed in Prostate 
US-10-552-515-1 

Query Match 100.0%; Score 4123; DB 5; Length 933; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 777; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 15 7 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 216 

Qy 61 LEWPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 217 LEWPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 276 

Qy 121 GIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 277 GIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 336 

Qy 181 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 337 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 396 

Qy 241 CLDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 397 CLDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 456 

Qy 301 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAVWMCLV 360 
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I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
457 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAWVMCLV 516 

361 SIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEM 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
517 SIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEM 5 76 

421 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
577 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 636 

4 81 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 5 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
63 7 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 6 96 

5 41 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVE IRLDARKFVCEYRRPVA 6 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
697 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVE IRLDARKFVCEYRRPVA 756 

6 01 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 66 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
75 7 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 816 

661 SSFAAAHNRTCRYRAFRDDDGHYSQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIP 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
817 SSFAAAHNRTCRYRAFRDDDGHYSQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIP 8 76 

721 ESVEIKVKREYYLAKQALAENEVLFGTNGTKDEQPKGSELSSHWTPFTVPKASQLQQ 777 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
877 ESVEIKVKREYYLAKQALAENEVLFGTNGTKDEQPKGSELSSHWTPFTVPKASQLQQ 933 



RESULT 2 

US-11-599-845A-700 

Sequence 700, Application US/11599845A 
Publication No. US20080025981A1 
GENERAL INFORMATION: 

APPLICANT: Young, Paul E. 

APPLICANT: Ebner, Reinhard 

APPLICANT: Weaver, Zoe 

APPLICANT: Strovel, Jeffrey W. 

APPLICANT: Horrigan, Stephen K. 

APPLICANT: Shea, Martin 

APPLICANT: Weigle, Bernd 

APPLICANT: Rieger, Michael 

APPLICANT: Rick, Jennifer A. 

APPLICANT: Cain, Colyn B. 

TITLE OF INVENTION: Cancer-linked Genes as Target for Chemotherapy 
FILE REFERENCE: 689290-273 

CURRENT APPLICATION NUMBER: US/11/599, 845A 
CURRENT FILING DATE: 2006-11-15 
PRIOR APPLICATION NUMBER: 10/585,466 
PRIOR FILING DATE: 2005-01-04 

PRIOR APPLICATION NUMBER: PCT/US2005/000040 
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PRIOR FILING DATE: 2005-01-04 

PRIOR APPLICATION NUMBER: 10/583,832 

PRIOR FILING DATE: 2004-12-16 

PRIOR APPLICATION NUMBER: PCT/US2004/42406 

PRIOR FILING DATE: 2004-12-16 

PRIOR APPLICATION NUMBER: 10/575,337 

PRIOR FILING DATE: 2004-10-07 

PRIOR APPLICATION NUMBER: PCT/US2004/33072 

PRIOR FILING DATE: 2004-10-07 

PRIOR APPLICATION NUMBER: 10/540,310 

PRIOR FILING DATE: 2003-12-19 

PRIOR APPLICATION NUMBER: PCT/US2003/40710 

PRIOR FILING DATE: 2003-12-19 

PRIOR APPLICATION NUMBER: 10/518,039 

PRIOR FILING DATE: 2003-06-10 

PRIOR APPLICATION NUMBER: PCT/US2003/19741 

PRIOR FILING DATE: 2003-06-10 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 769 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 700 

LENGTH: 885 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-11-599-845A-700 

Query Match 90.7%; Score 3739; DB 7; Length 885; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 702; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 6 0 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 15 8 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 217 

Qy 61 LEWPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 218 LEWPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 277 

Qy 121 GIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 78 GIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 33 7 

Qy 181 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 338 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 397 

Qy 241 CLDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 398 CLDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 457 

Qy 301 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAWVMCLV 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 458 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAWVMCLV 517 
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361 SIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEM 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
518 SIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEM 577 

421 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
5 7 8 HRTQTKFEDAFTLKVF IFQFVNF YSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 63 7 

4 81 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 5 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
638 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 6 9 7 

5 41 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVE IRLDARKFVCEYRRPVA 6 00 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6 98 ELVPCEGLFDEYLEMVLQFGFVT I FVAACPLAPLFALLNNWVE IRLDARKFVCEYRRPVA 75 7 

6 01 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 66 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
75 8 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 817 

661 SSFAAAHNRTCRYRAFRDDDGHYSQTYWNLLAIRLAFVIVFE 702 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
818 SSFAAAHNRTCRYRAFRDDDGHYSQTYWNLLAIRLAFVIVFE 85 9 



RESULT 3 

US-11-599-845A-698 

Sequence 698, Application US/11599845A 
Publication No. US20080025981A1 
GENERAL INFORMATION: 

APPLICANT: Young, Paul E. 

APPLICANT: Ebner, Reinhard 

APPLICANT: Weaver, Zoe 

APPLICANT: Strovel, Jeffrey W. 

APPLICANT: Horrigan, Stephen K. 

APPLICANT: Shea, Martin 

APPLICANT: Weigle, Bernd 

APPLICANT: Rieger, Michael 

APPLICANT: Rick, Jennifer A. 

APPLICANT: Cain, Colyn B. 

TITLE OF INVENTION: Cancer-linked Genes as Target for Chemotherapy 
FILE REFERENCE: 689290-273 

CURRENT APPLICATION NUMBER: US / 1 1 / 5 9 9 , 8 45A 
CURRENT FILING DATE: 2006-11-15 
PRIOR APPLICATION NUMBER: 10/585,466 
PRIOR FILING DATE: 2005-01-04 

PRIOR APPLICATION NUMBER: PCT/US2005/000040 

PRIOR FILING DATE: 2005-01-04 

PRIOR APPLICATION NUMBER: 10/583,832 

PRIOR FILING DATE: 2004-12-16 

PRIOR APPLICATION NUMBER: PCT/US2004/42406 

PRIOR FILING DATE: 2004-12-16 

PRIOR APPLICATION NUMBER: 10/575,337 

PRIOR FILING DATE: 2004-10-07 
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; PRIOR APPLICATION NUMBER: PCT/US2004/33072 

; PRIOR FILING DATE: 2004-10-07 

; PRIOR APPLICATION NUMBER: 10/540,310 

; PRIOR FILING DATE: 2003-12-19 

; PRIOR APPLICATION NUMBER: PCT/US2003/40710 

; PRIOR FILING DATE: 2003-12-19 

; PRIOR APPLICATION NUMBER: 10/518,039 

; PRIOR FILING DATE: 2003-06-10 

; PRIOR APPLICATION NUMBER: PCT/US2003/19741 

; PRIOR FILING DATE: 2003-06-10 

; Remaining Prior Application data removed - See File Wrapper or PALM. 

; NUMBER OF SEQ ID NOS : 769 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 698 

; LENGTH: 843 

; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-11-599-845A-698 



Query Match 86.6%; Score 3572; DB 7; Length 843; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 6 71; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 6 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
158 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 217 



61 LEWPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
218 LEWPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 2 7 7 



121 GIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2 78 GIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 33 7 



181 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 2 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
338 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 39 7 



2 41 CLDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
398 CLDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 45 7 



301 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAWVMCLV 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
45 8 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAWVMCLV 517 



361 SI ILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEM 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
518 SI ILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEM 577 



421 HRTQTKFEDAFTLKVFIFQF WFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
5 7 8 HRTQTKFEDAFTLKVF IFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 63 7 
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Qy 



4 81 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 5 4 0 




Db 



638 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 6 9 7 



Qy 



5 41 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 6 00 




Db 



6 98 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 75 7 



Qy 



6 01 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 66 0 




Db 



75 8 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 817 



Qy 



661 SSFAAAHNRTC 6 71 



Db 



818 SSFAAAHNRTC 828 



RESULT 4 

US-10-450-763-45847 

; Sequence 45847, Application US/10450763 

; Publication No. US20050196 754A1 

; GENERAL INFORMATION: 

; APPLICANT: Hyseq, Inc 

; TITLE OF INVENTION: NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 

; FILE REFERENCE: 790CIP3/US 

; CURRENT APPLICATION NUMBER: US/10/450,763 

; CURRENT FILING DATE: 2003-06-11 

; PRIOR APPLICATION NUMBER: PCT/US01/08631 

; PRIOR FILING DATE: 2001-03-30 

; PRIOR APPLICATION NUMBER: 09/540,217 

; PRIOR FILING DATE: 2000-03-31 

; PRIOR APPLICATION NUMBER: 09/649,167 

; PRIOR FILING DATE: 2000-08-23 

; NUMBER OF SEQ ID NOS : 60736 

; SOFTWARE: Custom 

; SEQ ID NO 45847 

; LENGTH: 898 

; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-10-450-763-45847 

Query Match 73.5%; Score 3031.5; DB 5; Length 898; 

Best Local Similarity 91.2%; Pred. No. 4.6e-291; 

Matches 578; Conservative 2; Mismatches 11; Indels 43; Gaps 2; 

Qy 1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I I I I I 

Db 25 0 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQDYPTRPPTGRPACCAWLGIPNVL 309 

Qy 61 LEWPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 310 LEWPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 369 
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121 GIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 180 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3 70 GIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 42 9 

181 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 2 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
430 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 489 

2 41 CLDCPFWLLSSACALAQ AGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRW 2 96 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

4 90 CLDCPFWLLSSACALAQVREEAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRW 5 4 9 

297 DCSDYEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAWV 356 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
55 0 DCSDYEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAWV 6 09 

35 7 MCLVSIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLT 416 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
610 MCLVSIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLT 66 9 

417 RWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECA 4 76 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6 70 RWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECA 72 9 

477 AGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPW 536 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

730 AGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPW 7 89 

53 7 EDDYELVPCEGLFDEYLEM VL 55 7 

I I I I I I I I I I I I I I I I I I I II 

7 90 EDDYELVPCEGLFDEYLEMGAGFCPNACPELVPELTEPEKARDQPEARSAGQDSRPEAVL 84 9 

558 QFGFVTIFVAACPLAPLFALLNNWVEIRLDARKF 591 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I 
850 QFGFVTIFVAACPLAPLFALLNNWVEIRLDARKF 883 



RESULT 5 

US-10-104-047-2574 

; Sequence 2574, Application US/10104047 

; Publication No. US20030236392A1 

; GENERAL INFORMATION: 

; APPLICANT: HELIX RESEARCH INSTITUTE 

; TITLE OF INVENTION: No. US20030236392Alel full length cDNA 

; FILE REFERENCE: H1-A0105 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 1 0 4 , 04 7 

; CURRENT FILING DATE: 2002-03-25 

; PRIOR APPLICATION NUMBER: 

; PRIOR FILING DATE: 

; NUMBER OF SEQ ID NOS : 4096 

; SOFTWARE: Patentln Ver . 2.1 

; SEQ ID NO 2574 

; LENGTH: 920 

; TYPE: PRT 
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; ORGANISM: Homo sapiens 
US-10-104-047-2574 

Query Match 36.4%; Score 1502.5; DB 4; Length 920; 

Best Local Similarity 40.4%; Pred. No. 4.5e-139; 

Matches 328; Conservative 145; Mismatches 270; Indels 69; Gaps 20; 

Qy 8 NTTVHYALLSASWAVLCYYAEDLRLKLPLQE LPNQASNWS AGLLAWLGIP 5 7 

I : : : I I I I I I I I : : : : I : I I : I : I II 

Db 122 NSDI IFVKLHAPWEVLGRYAEQMNVRMPFRRKIYYLPRRYKFMSRIDKQISRLRRWLPKK 181 

Qy 58 NVLL — EWPDVPP-EYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGH 114 

: I I : I I : : I : I : : I : I : : I I I : I : I : I I : I 
Db 182 PMRLDKETLPDLEENDCYTAPFSQQRIHHFI— IHNKETFFNNATRSRIVHHILQRIKY-E 239 

Qy 115 EKKNLLGIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQ 174 

I I I : I : : : I I I I I I I I I : I : : : I I : I : : I I I I I I I I 

Db 240 EGKNKIGLNRLLTNGSYEAAFPLHEGSYRSKNSIRTHGAENHRHLLYECWASWGVWYKYQ 299 

Qy 175 PLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDS 234 

I I I I I I I I I I I : I I I I I I I : I I I I I i I : I I I I I : : : I : I : I 

Db 300 PLDLVRRYFGEKIGLYFAWLGWYTGMLFPAAFIGLFVFLYGVTTLDHSQVSKEVCQATDI 359 

Qy 235 FEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLA 293 

I I I : I III I I : I I : I I I : I I I I I : : I I I : I I : I I : I I I : I : I 
Db 360 I-MCPVCDKYCPFMRLSDSCVYAKVTHLFDNGATVFFAVFMAVWATVFLEFWKRRRAVIA 418 

Qy 294 YRWDCSDYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPERSRARRMLAGSWIWMV 352 

III I : I : I I I I I I I : I I I : I : I I I : I : : : I I : 

Db 419 YDWDLIDWEEEEEEIRPQFEAKYSKKERMNPISGKPEPYQAFTDKCSRLIVSASGIFFMI 4 78 

Qy 353 AWVMCLVSIILYRAIMAIWSRSGNTLLA-AWA SRIASLTGSW — NLVFILIL 404 

II: : I : : I I : : I I II I : : I : I I : I I I : : I 

Db 479 CWIAAVFGIVIYRWTV STFAAFKWALIRNNSQVAT-TGTAVCINFCI IMLL 530 

Qy 405 SKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNY 464 

: : I : I : I I I I I : : : : I : : I I I I : I : I I I I I II I I I I I III I : I I I 
Db 531 NVLYEKVALLLTNLEQPRTESEWENSFTLKMFLFQFVNLNSSTFYIAFFLGRFTGHPGAY 590 

Qy 465 HTLFG-VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKR 523 

I I I I I I I I I : I : : : I I I I I II I : I : : I I : I : : 
Db 591 LRLINRWRLEECHPSGCLIDLCMQMGIIMVLKQTWNNFMELGYPLIQNWWTR RKVRQ 647 

Qy 524 KAGASAGASQGPWEDDYELVPCE — GLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNW 581 

: I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I 

Db 648 EHGPERKISFPQWEKDYNLQPMNAYGLFDEYLEMILQFGFTTIFVAAFPLAPLLALLNNI 707 

Qy 582 VEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYR 641 

: I I I I I I III : : I I I : I I I : I I I I I : II I : I : I I : I I I : : I : I I I : I I I 
Db 708 IEIRLDAYKFVTQWRRPLASRAKDIGIWYGILEGIGILSVITNAFVIAITSDFIPRLVYA 76 7 

Qy 6 42 W TRAHDLRGFLNFTLA RAPSSFAAAHNRTCRYRAFR 6 77 

: : I : : I : I : I I : : I I I I : I 

Db 76 8 YKYGPCAGQGEAGQKCMVGYVNASLSVFRISDFENRSEPESDGSEFSGTPLKYCRYRDYR 82 7 
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Qy 



6 78 DDDGH YSQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYY 732 



Db 



82 8 DPPHSLVPYGYTLQFWHVLAARLAFIIVFEHLVFCIKHLISYLIPDLPKDLRDRMRREKY 88 7 



Qy 



733 LAKQALAENEVLFGTNGTKDEQPKGSELSSHW 764 



Db 



888 LIQEMMYEAELERLQKERKERKKNGKAHHNEW 919 



RESULT 6 

US-11-072-512-2574 

; Sequence 2574, Application US/11072512 

; Publication No. US20060029945A1 

; GENERAL INFORMATION: 

; APPLICANT: ISOGAI, TAKAO 

; APPLICANT: SUGIYAMA, TOMOYASU 

; APPLICANT: OTSUKI, TETSUJI 

; APPLICANT: WAKAMATSU, AI 

; APPLICANT: SATO, HIROYUKI 

; APPLICANT: ISHII, SHIZUKO 

; APPLICANT: YAMAMOTO, JUN-ICHI 

; APPLICANT: ISONO, YUUKO 

; APPLICANT: HIO, YURI 

; APPLICANT: OTSUKA, KAORU 

; APPLICANT: NAG A I , KEIICHI 

; APPLICANT: IRIE, RYOTARO 

; APPLICANT: TAMECHIKA, ICHIRO 

; APPLICANT: SEKI, NAOHIKO 

; APPLICANT: YOSHIKAWA, TSUTOMU 

; APPLICANT: OTSUKA, MOTOYUKI 

; APPLICANT: NAGAHARI, KENJI 

; APPLICANT: MASUHO, YASUHIKO 

; TITLE OF INVENTION: Novel full length cDNA 

; FILE REFERENCE: 084335-0191 

; CURRENT APPLICATION NUMBER: US / 1 1 / 0 72 , 512 

; CURRENT FILING DATE: 2005-03-07 

; PRIOR APPLICATION NUMBER: US 60/350,978 

; PRIOR FILING DATE: 2002-01-25 

; PRIOR APPLICATION NUMBER: JP 2001-379298 

; PRIOR FILING DATE: 2001-11-05 

; NUMBER OF SEQ ID NOS : 4096 

; SOFTWARE: Patentln Ver . 2.1 

; SEQ ID NO 2574 

; LENGTH: 920 

; TYPE : PRT 

; ORGANISM: Homo sapiens 
US-11-072-512-2574 

Query Match 36.4%; Score 1502.5; DB 6; Length 920; 

Best Local Similarity 40.4%; Pred. No. 4.5e-139; 

Matches 328; Conservative 145; Mismatches 270; Indels 69; Gaps 20; 
Qy 8 NTTVHYALLSASWAVLCYYAEDLRLKLPLQE LPNQASNWS AGLLAWLGIP 5 7 
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I : : : I I I I I I I I : : : : I : II: I : I II 

122 NSDI IFVKLHAPWEVLGRYAEQMNVRMPFRRKIYYLPRRYKFMSRIDKQISRLRRWLPKK 181 

5 8 NVLL — EWPDVPP-EYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGH 114 
: I I : I I : : I : I : : I : I : : I I I : I : I : I I : I 
182 PMRLDKETLPDLEENDCYTAPFSQQRIHHFI— IHNKETFFNNATRSRIVHHILQRIKY-E 239 

115 EKKNLLGIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQ 17 4 

I I I : I : : : I I I I I I I I I : I : : : I I : I : : I I I I I I I I 

2 4 0 EGKNKIGLNRLLTNGSYEAAFPLHEGSYRSKNSIRTHGAENHRHLLYECWASWGVWYKYQ 2 99 

175 PLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDS 234 

I I I I I I I I I I I : I I I I I I I : I I I I I I I : I I I I I : : : I : I : I 

300 PLDLVRRYFGEKIGLYFAWLGWYTGMLFPAAFIGLFVFLYGVTTLDHSQVSKEVCQATDI 35 9 

235 FEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLA 2 93 

I I I : I III I I : I I : I I I : I I I I I : : I I I : I I : I I : I I I : I : I 
36 0 I-MCPVCDKYCPFMRLSDSCVYAKVTHLFDNGATVFFAVFMAVWATVFLEFWKRRRAVIA 418 

294 YRWDCSDYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPERSRARRMLAGSWIWMV 352 

III I : I : I I I I I I I : I I I : I : I I I : I : : : I I : 

419 YDWDLIDWEEEEEEIRPQFEAKYSKKERMNPISGKPEPYQAFTDKCSRLIVSASGIFFMI 4 78 

353 AWVMCLVSI ILYRAIMAIWSRSGNTLLA-AWA SRIASLTGSW — NLVFILIL 404 

II: : I : : I I : : I I II I : : I : I I : I I I : : I 

479 CWIAAVFGIVIYRWTV STFAAFKWALIRNNSQVAT-TGTAVCINFCI IMLL 530 

4 05 SKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNY 46 4 

: : I : I : I I I I I : : : : I : : I I I I : I : I I I I I II I I I I I III I : I I I 
531 NVLYEKVALLLTNLEQPRTESEWENSFTLKMFLFQFVNLNSSTFYIAFFLGRFTGHPGAY 590 

465 HTLFG-VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKR 523 

I I I I I I I I I : I : : : I I I I I II I : I : : I I : I : : 
591 LRLINRWRLEECHPSGCLIDLCMQMGIIMVLKQTWNNFMELGYPLIQNWWTR RKVRQ 647 

52 4 KAGASAGASQGPWEDDYELVPCE — GLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNW 5 81 

: I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I 

6 48 EHGPERKISFPQWEKDYNLQPMNAYGLFDEYLEMILQFGFTTIFVAAFPLAPLLALLNNI 70 7 

5 82 VEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYR 6 41 

: I I I I I I III : : I I I : I I I : I I I I I : II I : I : I I : I I I : : I : I I I : I I I 
708 IEIRLDAYKFVTQWRRPLASRAKDIGIWYGILEGIGILSVITNAFVIAITSDFIPRLVYA 76 7 

6 42 W TRAHDLRGFLNFTLA RAPSSFAAAHNRTCRYRAFR 6 77 

: : | : : | : | : I I : : I I I I : I 

76 8 YKYGPCAGQGEAGQKCMVGYVNASLSVFRISDFENRSEPESDGSEFSGTPLKYCRYRDYR 82 7 

6 78 DDDGH YSQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYY 732 

I I : : I : : I I I I I I : I I I M : I I : I : I : I I : I : : : : : I I I 

82 8 DPPHSLVPYGYTLQFWHVLAARLAFIIVFEHLVFCIKHLISYLIPDLPKDLRDRMRREKY 88 7 

733 LAKQALAENEVLFGTNGTKDEQPKGSELSSHW 764 

I : : : I I : I : : I : I 

888 LIQEMMYEAELERLQKERKERKKNGKAHHNEW 919 
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RESULT 7 

US-1 1-5 82-86 1-10841 

; Sequence 10841, Application US/11582861 

; Publication No. US20070099251A1 

; GENERAL INFORMATION: 

; APPLICANT: Zhang, Hui 

; APPLICANT: Aebersold, Rudolf H. 

; TITLE OF INVENTION: TISSUE- AND SERUM-DERIVED GLYCOPROTEINS 

; TITLE OF INVENTION: AND METHODS OF THEIR USE 

; FILE REFERENCE: 460092.404 

; CURRENT APPLICATION NUMBER: US/11/582,861 

; CURRENT FILING DATE: 2006-10-17 

; PRIOR APPLICATION NUMBER: US 60/728,044 

; PRIOR FILING DATE: 2005-10-17 

; NUMBER OF SEQ ID NOS : 14918 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 10841 

; LENGTH: 981 

; TYPE : PRT 

; ORGANISM: Homo sapiens 
US-11-582-861-10841 



Query Match 36.3%; Score 1498.5; DB 6; Length 981; 

Best Local Similarity 40.3%; Pred. No. 1.2e-138; 

Matches 327; Conservative 145; Mismatches 271; Indels 69; Gaps 20; 

y 8 NTTVHYALLSASWAVLCYYAEDLRLKLPLQE LPNQASNWS AGLLAWLGIP 5 7 

I : : : I I I I I I I I : : : : I : I I : I : II 

b 183 NSDI IFVKLHAPWEVLGRYAEQMNVRMPFRRKIYYLPRRYKFMSRIDKQISRFRRWLPKK 242 

y 58 NVLL — EWPDVPP-EYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGH 114 

: I I : I I : : I : I : : I : I : : I I I : I : I : II: I 

b 243 PMRLDKETLPDLEENDCYTAPFSQQRIHHFI-IHNKETFFNNATRSRIVHHILQRIKY-E 300 



Qy 115 EKKNLLGIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQ 174 

I I I : I : : : I I I I I I I I I : I : : : I I : I : : I I I I I I I I 

Db 301 EGKNKIGLNRLLTNGSYEAAFPLHEGSYRSKNSIRTHGAENHRHLLYECWASWGVWYKYQ 360 



Qy 175 PLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDS 234 

I I I I I I I I I I I : I I I I I I I : I I I I I I I : I I I I I : : : I : I : I 

Db 361 PLDLVRRYFGEKIGLYFAWLGWYTGMLFPAAFIGLFVFLYGVTTLDHSQVSKEVCQATDI 420 



Qy 235 FEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLA 293 

I I I : I III I I : I I : I I I : I I I I I : : I I I : I I : I I : I I I : I : I 
Db 421 I-MCPVCDKYCPFMRLSDSCVYAKVTHLFDNGATVFFAVFMAVWATVFLEFWKRRRAVIA 479 



Qy 294 YRWDCSDYEDTEERPRPQFAAS— APMTAPNPITGEDEPYFPERSRARRMLAGSWIWMV 352 

III I : I : I I I I I I I : I I I : I : I I I : I : : : I I : 

Db 480 YDWDLIDWEEEEEEIRPQFEAKYSKKERMNPISGKPEPYQAFTDKCSRLIVSASGIFFMI 539 

Qy 353 AVWMCLVSI ILYRAIMAIWSRSGNTLLA— AWA SRIASLTGSW — NLVFILIL 404 

II: : I : : I I : : I I II I : : I : I I : I I I : : I 
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Db 540 CWIAAVFGIVIYRWTV STFAAFKWALIRNNSQVAT-TGTAVCINFCI IMLL 591 



Qy 405 SKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNY 464 

: : I : I : I I I I I : : : : I : : I I I I : I : I I I I I II I I I I I III I : I I I 
Db 592 NVLYEKVALLLTNLEQPRTESEWENSFTLKMFLFQFVNLNSSTFYIAFFLGRFTGHPGAY 651 

Qy 465 HTLFG-VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKR 523 

I I I I I I I I I : I : : : I I I I I II I : I : : I I : I : : 
Db 652 LRLINRWRLEECHPSGCLIDLCMQMGIIMVLKQTWNNFMELGYPLIQNWWTR RKVRQ 708 

Qy 524 KAGASAGASQGPWEDDYELVPCE — GLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNW 581 

: I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I 

Db 709 EHGPERKISFPQWEKDYNLQPMNAYGLFDEYLEMILQFGFTTIFVAAFPLAPLLALLNNI 768 

Qy 582 VEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYR 641 

: I I I I I I III : : I I I : I I I : I I I I I : II I : I : I I : I I I : : I : I I I : I I I 
Db 769 IEIRLDAYKFVTQWRRPLASRAKDIGIWYGILEGIGILSVITNAFVIAITSDFIPRLVYA 828 

Qy 6 42 W TRAHDLRGFLNFTLA RAPSSFAAAHNRTCRYRAFR 6 77 

: : I : : I : I : I I : : I I I I : I 

Db 829 YKYGPCAGQGEAGQKCMVGYVNASLSVFRISDFENRSEPESDGSEFSGTPLKYCRYRDYR 888 

Qy 6 78 DDDGH YSQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYY 732 

I I : : I : : I I I I I I : I I I I I : I I : I : I : I I : I : : : : : I I I 

Db 889 DPPHSLVPYGYTLQFWHVLAARLAFIIVFEHLVFCIKHLISYLIPDLPKDLRDRMRREKY 948 

Qy 733 LAKQALAENEVLFGTNGTKDEQPKGSELSSHW 764 

I : : : I I : I : : I : I 

Db 949 LIQEMMYEAELERLQKERKERKKNGKAHHNEW 980 



RESULT 8 

US-11-443-428A-801153 

; Sequence 801153, Application US/11443428A 

; Publication No. US20070083334A1 

; GENERAL INFORMATION: 

; APPLICANT: Mintz, Liat 

; APPLICANT: Xie, Hanqing 

; APPLICANT: Dahari, Dvir 

; APPLICANT: Levanon, Erez 

; APPLICANT: Freilich, Shiri 

; APPLICANT: Beck, Nili 

; APPLICANT: Zhu, Wei-Yong 

; APPLICANT: Wasserman, Alon 

; APPLICANT: Hermesh, Chen 

; APPLICANT: Azar, Idit 

; APPLICANT: Bernstein, Jeanne 

; TITLE OF INVENTION: METHODS AND SYSTEMS USEFUL FOR ANNOTATING BIOMOLECULAR SEQUENCES 

; FILE REFERENCE: 02/23929 

; CURRENT APPLICATION NUMBER: US/11/443, 428A 

; CURRENT FILING DATE: 2006-05-31 

; NUMBER OF SEQ ID NOS : 1034312 

; SOFTWARE: Patentln version 3.1 

; SEQ ID NO 801153 
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; LENGTH: 981 
; TYPE : PRT 

; ORGANISM: Homo sapiens 
US-1 1-4 43-42 8A-8 01 153 

Query Match 35.7%; Score 1471.5; DB 6; Length 981; 

Best Local Similarity 41.2%; Pred. No. 5.9e-136; 

Matches 318; Conservative 142; Mismatches 241; Indels 71; Gaps 22; 

Qy 2 0 WAVLCYYAEDLRLKLP LQELPNQASNWSAGLLAWLGIPNVL-L 61 

I I I I I I I : : : I : I : I I I III 
Db 214 WDTLCKYAERLNIRMPFRKKCYYTDGRSKSMGRMQTYFRRIKNWMA QNPMVLDK 26 7 

Qy 62 EWPDV-PPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPY — GHEKKN 118 

II: : I : I : : I : : I : I I I I : : I : I : : : I : I I I I 
Db 268 SAFPDLEESDCYTGPFSRARIHHFI-INNKDTFFSNATRSRIVYHMLERTKYENGISK — 324 

119 LLGIHQLLAE GVLSAAFPLHDGPFKT PPEGPQAPRLNQRQVLFQHWARWGKWNKYQ 17 4 

: I I : I : I I I I I I : I : I : III I I : I : : I I I I I I I : I 

325 -VGIRKLINNGSYIAAFPPHEGAYKSSQPIKTHGPQ NNRHLLYERWARWGMWYKHQ 3 79 

175 PLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDS 234 

I I I : I I I I I I : I I I I I I I : I I I I : I ! I : I I II II:: : I I : I : : 
380 PLDLIRLYFGEKIGLYFAWLGWYTGMLIPAAIVGLCVFFYGLFTMNNSQVSQEICKATEV 439 

235 FEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLA 293 

I I I I I I : I I : : I I : I I I : I I I I I I : : I I I : I I : I I : I I I : : I 

4 4 0 F-MCPLCDKNCSLQRLNDSCI YAKVTYLFDNGGTVFFAIFMAIWATVFLEFWKRRRS ILT 4 98 

294 YRWDCSDYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPERSRARRMLAGSWIWMV 352 

III : : I : I I I I I I I I I I I I I : I I : I : hi I I : 

499 YTWDLIEWEEEEETLRPQFEAKYYKMEIVNPITGKPEPHQPSSDKVTRLLVSVSGIFFMI 55 8 

353 AVWMCLVSIILYR-AIMAIWSRSGNTLLAAWASRIASLTGSV-VNLVFILILSKIYVS 410 

: : I : : : : : I I : I I I : I : I : : I : I : I : : I : I 
559 S L VI T AVF GVWYRL WME QF AS F KWNF I KQ YW — QFATSAAAVCINF 1 1 IMLLNLAYEK 616 

411 LAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFG- 46 9 

: I : : I I I I I : : : : I : : I I I : I : I I I I I II I I I I I I I I I I : I I I : II 
617 IAYLLTNLEYPRTESEWENSFALKMFLFQFVNLNSSIFYIAFFLGRFVGHPGKYNKLFDR 6 76 

4 70 VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASA 52 9 

I I I I I I I I : I : : I I I I I : I I I : I : : I I : : : II 
6 77 WRLEECHPSGCLIDLCLQMGVIMFLKQIWNNFMELGYPLIQNWWSRHKI KRGIH- 730 

530 GASQGPWEDDYELVP — CEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLD 58 7 

II I I : I : I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I 
731 DASIPQWENDWNLQPMNLHGLMDEYLEMVLQFGFTTIFVAAFPLAPLLALLNNI IEIRLD 790 

588 ARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRW 6 42 

I I I I : : I I I : I I I I I I I III: I I I I : I I I : : I : I I : : I I I : 
791 AYKFVTQWRRPLPARATDIGIWLGILEGIGILAVITNAFVIAITSDYIPRFVYEYKYGPC 85 0 

6 43 TRAHDLRGFLNFTLARAP— SSFAAAHNRTCRYRAFR DDDGHYSQTYWNLL 6 91 
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: I : I : : I : I : I : I I I I : I : : I I : : I 

Db 851 ANHVEPSENCLKGYVNNSLSFFDLSELGMGKSGYCRYRDYRGPPWSSKPYEFTLQYWHIL 910 

Qy 692 AIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEV 743 

I I I I I : I I I I I : I I : : I : I I : I : : : : : I I I I : : : I I : 
Db 911 AARLAFI IVFEHLVFGIKSFIAYLIPDVPKGLHDRIRREKYLVQEMMYEAEL 962 



RESULT 9 

US-11-582-861-9875 

; Sequence 9875, Application US/11582861 

; Publication No. US20070099251A1 

; GENERAL INFORMATION: 

; APPLICANT: Zhang, Hui 

; APPLICANT: Aebersold, Rudolf H. 

; TITLE OF INVENTION: TISSUE- AND SERUM-DERIVED GLYCOPROTEINS 

; TITLE OF INVENTION: AND METHODS OF THEIR USE 

; FILE REFERENCE: 460092.404 

; CURRENT APPLICATION NUMBER: US/11/582,861 

; CURRENT FILING DATE: 2006-10-17 

; PRIOR APPLICATION NUMBER: US 60/728,044 

; PRIOR FILING DATE: 2005-10-17 

; NUMBER OF SEQ ID NOS : 14918 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 9875 

; LENGTH: 1046 

; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-11-582-861-9875 



Query Match 35.7%; Score 1471.5; DB 6; Length 1046; 

Best Local Similarity 41.2%; Pred. No. 6.5e-136; 

Matches 318; Conservative 142; Mismatches 241; Indels 71; Gaps 22; 

Qy 2 0 WAVLCYYAEDLRLKLP LQELPNQASNWSAGLLAWLGIPNVL-L 61 

II::: : I : I I I III 
Db 2 79 WDTLCKYAERLNIRMPFRKKCYYTDGRSKSMGRMQTYFRRIKNWMA QNPMVLDK 332 



Qy 62 EWPDV-PPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPY — GHEKKN 118 

II: : I : I : : I : : I : I I I I : : I : I : : : I : I I I I 
Db 333 SAFPDLEESDCYTGPFSRARIHHFI-INNKDTFFSNATRSRIVYHMLERTKYENGISK — 389 



Qy 119 LLGIHQLLAE GVLSAAFPLHDGPFKT PPEGPQAPRLNQRQVLFQHWARWGKWNKYQ 174 

: I I : I : I I I I I I : I : I : III I I : I : : I I I I I I I : I 

Db 390 -VGIRKLINNGSYIAAFPPHEGAYKSSQPIKTHGPQ NNRHLLYERWARWGMWYKHQ 444 



Qy 175 PLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDS 234 

I I I : I I I I I I : I I I I I I I : I I I I : I I I : I I II I I : : : I I : I : : 
Db 445 PLDLIRLYFGEKIGLYFAWLGWYTGMLIPAAIVGLCVFFYGLFTMNNSQVSQEICKATEV 504 



Qy 235 FEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLA 2 93 

I I I I I I : I I : : I I : I I I : I I I I I I : : I I I : I I : I I : I I I : : I 

Db 505 F-MCPLCDKNCSLQRLNDSCIYAKVTYLFDNGGTVFFAIFMAIWATVFLEFWKRRRSILT 563 
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294 YRWDCSDYEDTEERPRPQFAAS— APMTAPNPITGEDEPYFPERSRARRMLAGSVVIVVMV 352 

III : : I : I I I I I I I I I I I I I : I I : I : I : I I I : 

56 4 YTWDLIEWEEEEETLRPQFEAKYYKMEIVNPITGKPEPHQPSSDKVTRLLVSVSGIFFMI 623 

353 AWVMCLVSIILYR-AIMAIWSRSGNTLLAAWASRIASLTGSV-VNLVFILILSKIYVS 410 

: : I : : : : : I I : I I I : I : I : : I : I : I : : I : I 

624 S L V I T AVF GVWYRLWME QF AS F KWNF I KQ YW — QFATSAAAVCINF 1 1 IMLLNLAYEK 681 

411 LAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFG- 46 9 
: I : : I I I I I : : : : I : : I I I : I : I I I I I II I I I I I I I I I I : I I I : II 

6 82 IAYLLTNLEYPRTESEWENSFALKMFLFQFVNLNSSIFYIAFFLGRFVGHPGKYNKLFDR 741 

4 70 VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASA 52 9 

I I I I I I I I : I : : I I I I I : I I I : I : : I I : : : II 

7 42 WRLEECHPSGCLIDLCLQMGVIMFLKQIWNNFMELGYPLIQNWWSRHKI KRGIH- 7 95 

530 GASQGPWEDDYELVP — CEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLD 58 7 

II I I : I : I I II I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I : I I I I I 

7 96 DASIPQWENDWNLQPMNLHGLMDEYLEMVLQFGFTTIFVAAFPLAPLLALLNNI IEIRLD 855 

588 ARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRW 6 42 

I I I I : : I I I : I I I I I I I III: I I I I : I I I : : I : I I : : I I I : 
856 AYKFVTQWRRPLPARATDIGIWLGILEGIGILAVITNAFVIAITSDYIPRFVYEYKYGPC 915 

6 43 TRAHDLRGFLNFTLARAP-SSFAAAHNRTCRYRAFR DDDGHYSQTYWNLL 6 91 

: I : I : : I : I : I : III: : : I I : : I 

916 ANHVEPSENCLKGYVNNSLSFFDLSELGMGKSGYCRYRDYRGPPWSSKPYEFTLQYWHIL 9 75 

692 AIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEV 743 

I I I I I : I I I I I : I I : : I : I I : I : : : : : I I I I : : : I I : 
976 AARLAFIIVFEHLVFGIKSFIAYLIPDVPKGLHDRIRREKYLVQEMMYEAEL 1027 



RESULT 10 
US-11-177-894-11 

; Sequence 11, Application US/11177894 

; Publication No. US20060040292A1 

; GENERAL INFORMATION: 

; APPLICANT: West, et al . 

; TITLE OF INVENTION: Tumor Markers and Uses Thereof 

; FILE REFERENCE: 2002850-0048 

; CURRENT APPLICATION NUMBER: US/ 1 1 / 1 7 7 , 8 94 

; CURRENT FILING DATE: 2005-07-08 

; NUMBER OF SEQ ID NOS : 29 

; SOFTWARE: Patentln version 3.2 

; SEQ ID NO 11 

; LENGTH: 840 

; TYPE: PRT 

; ORGANISM: Artificial 

; FEATURE : 

; OTHER INFORMATION: Homo sapiens 
US-11-177-894-11 

Query Match 35.4%; Score 1461.5; DB 6; Length 840; 
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Best Local Similarity 40.1%; Pred. No. 4.7e-135; 

Matches 331; Conservative 149; Mismatches 260; Indels 85; Gaps 21; 

Qy 5 QDGNTTVH YALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLL 61 

: I : I : I : : I I I I I I I I : I I : I : : : : I : I I I I : I I 

Db 28 RDEDTKIHGVGFVKIHAPWNVLCREAEFLKLKMPTKKMYH — INETRGLLK — KINSVLQ 83 

Qy 62 EWPDVPPEYYSCR FRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTP 111 

: : : I : I I I I I I : I : I I I I I : : I I I : I 

Db 84 KITDPIQPKVAEHRPQTMKRLSYPFSREKQHLFDLSD-KDSFFDSKTRSTIVYEILKRTT 142 

Qy 112 YGHEKKNLLGIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWN 171 

I : : I I I I I I I : I I : I I I I I : : I I : : I : : I I I : I : 

Db 143 CTKAKYS-MGITSLLANGVYAAAYPLHDGDY NGENVEFNDRKLLYEEWARYGVFY 196 

Qy 172 KYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGS 231 

I I I I : I I I : I I I I I : I I I I I I I II I : I I : : I I : I I I II : : I I : I : I 
Db 197 KYQPIDLVRKYFGEKIGLYFAWLGVYTQMLIPASIVGI IVFLYGCATMDENIPSMEMCDQ 256 

Qy 232 KDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSA 290 

: : I I I I I I : I : I I I I I I : I III: I I I I I : I I I I I I : I : I I I I 
Db 257 RHNITMCPLCDKTCSYWKMSSACATARASHLFDNPATVFFSVFMALWAATFMEHWKRKQM 316 

Qy 291 TLAYRWDCSDYEDTEERPRPQFAA SAPMTAPNPITGEDEPYFPERSRARRMLAGS 345 

I I I I I : : I : I : I I : : I I : I I I : II I 

Db 317 RLNYRWDLTGFEEEEDHPRAEYEARVLEKSLKKESRNKET — DKVKLTWRDRFPAYLTNL 3 74 

Qy 346 WIWMVAVWMCLVSIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILS 405 

I I : I : I I : : : I : I I II : : : : : : : I : : I I I I : : I 

Db 375 VSIIFMIAVTFAIVLGVIIYRISMAAALAMNSSPSVRSNIRVTVTATAVIINLWIILLD 434 

Qy 406 KIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYH 465 

: : I : I I I : I : : I : I I : I I : : I I I I : I : I I I I I I I I I I I : I 

Db 435 EVYGCIARWLTKIEVPKTEKSFEERLIFKAFLLKFVNSYTPIFYVAFFKGRFVGRPGDYV 494 

Qy 466 TLF-GVRNEECAAGGCLIELAQELLVIMVGKQVI-NNMQEVLIPKLKGWWQKFRLRSKKR 523 

: I I I I I I I I I I : I I : I : I I : I I I : I I I : I : I I I : I : : I : : 
Db 495 YIFRSFRMEECAPGGCLMELCIQLSI IMLGKQLIQNNLFEIGIPKMKKLIRYLKLKQQSP 554 

Qy 524 KAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVE 583 

: : I II I I II I I : I I : : I I I I I I : I I I : I I I I I I I I I I I : I 
Db 555 PDHEECVKRKQRYEVDYNLEPFAGLTPEYMEMIIQFGFVTLFVASFPLAPLFALLNNIIE 614 

Qy 584 IRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRA — YYR 641 

11111:111 I I I I I I I I : I I I I I :: I I I : I I I I : I I : : : I : I I I : I I I 
Db 615 IRLDAKKFVTELRRPVAVRAKDIGIWYNILRGIGKLAVI IDAFVISFTSDFIPRLVYLYM 674 

Qy 6 42 WTRAHDLRGFLNFTLARAPSSF AAAHN RTCRYRAFRD DDGH 6 82 

: : : : I I : I I I III II : I I I : : I : : 

Db 6 75 YSKNGTMHGFVNHTL SSFNVSDFQNGTAPNDPLDLGYEVQICRYKDYREPPWSENK 730 

Qy 683 Y — SQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLA 734 

I I : : I : I I I I I I I I I I : : : I : : I : : I I I I : : : : : I I 
Db 731 YDISKDFWAVLAARLAFVIVFQNLVMFMSDFVDWVIPDIPKDISQQIHKEKVLMVELFMR 790 
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Qy 735 KQALAENEVLFGTNGTKDEQP KGSELSSH 763 

I I I I I I I I I I I I 

Db 791 EEQDKQQLL — ETCMEKERQKDEPPCNHHNTKACPDSLGSPAPSH 833 



RESULT 11 
US-11-177-894-7 

; Sequence 7, Application US/11177894 
; Publication No. US20060040292A1 
; GENERAL INFORMATION: 
; APPLICANT: West, et al . 

; TITLE OF INVENTION: Tumor Markers and Uses Thereof 

; FILE REFERENCE: 2002850-0048 

; CURRENT APPLICATION NUMBER: US/11/177,894 

; CURRENT FILING DATE: 2005-07-08 

; NUMBER OF SEQ ID NOS : 29 

; SOFTWARE: Patentln version 3.2 

; SEQ ID NO 7 

; LENGTH: 960 

; TYPE: PRT 

; ORGANISM: Artificial 

; FEATURE : 

; OTHER INFORMATION: Transmembrane protein 
US-11-177-894-7 

Query Match 35.3%; Score 1456.5; DB 6; Length 960; 

Best Local Similarity 40.2%; Pred. No. 1.8e-134; 

Matches 333; Conservative 147; Mismatches 260; Indels 89; Gaps 22; 

Qy 5 QDGNTTVH YALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLL 61 

: I : I : I : : I I I I I I I I : I I : I : : : : I : I I I I : I I 

Db 144 RDEDTKIHGVGFVKIHAPWNVLCREAEFLKLKMPTKKMYH — INETRGLLK — KINSVLQ 199 

Qy 62 EWPDVPPEYYSCR FRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTP 111 

: : : I : I I I I I I : I : I I I I I : : I I I : I 

Db 200 KITDPIQPKVAEHRPQTMKRLSYPFSREKQHLFDLSD-KDSFFDSKTRSTIVYEILKRTT 258 

Qy 112 YGHEKKNLLGIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWN 171 

I : : I I I I I I I : I I : I I I I I : : I I : : I : : I I I : I : 

Db 259 CTKAKYS-MGITSLLANGVYAAAYPLHDGDY NGENVEFNDRKLLYEEWARYGVFY 312 

Qy 172 KYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGS 231 

I I I I : I I I : I I I I I : I I I I I I I II I : I I : : I I : I I I II : : I I : I : I 
Db 313 KYQPIDLVRKYFGEKIGLYFAWLGVYTQMLIPASIVGI IVFLYGCATMDENIPSMEMCDQ 372 

Qy 232 KDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSA 290 

: : I I I I I I : I : I I I I I I : I III: I I I I I : I I I I I I : I : I I I I 
Db 3 73 RHNITMCPLCDKTCSYWKMSSACATARASHLFDNPATVFFSVFMALWAATFMEHWKRKQM 432 

Qy 291 TLAYRWDCSDYEDTEE RPRPQFAA SAPMTAPNPITGEDEPYFPERSRARRM 341 

I I I I I : : I : I I I I : : I I : I I I : II 

Db 433 RLNYRWDLTGFEEEEEAVKDHPRAEYEARVLEKSLKKESRNKET — DKVKLTWRDRFPAY 490 
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342 LAGSWIWMVAWVMCLVSIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFI 401 

I I I : I : I I : : : I : I I II : : : : : : : | : : I I I I 

491 LTNLVS I IFMIAVTFAIVLGVI I YRISMAAALAMNSSPSVRSNIRVTVTATAVI INLWI 550 

4 02 LILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYP 461 

: : I : : I : I I I : I : : I : I I : I I : : I I I I : I : I I I I I I I I I I 

551 ILLDEVYGCIARWLTKIEVPKTEKSFEERLIFKAFLLKFVNSYTPIFYVAFFKGRFVGRP 610 

462 GNYHTLF-GVRNEECAAGGCLIELAQELLVIMVGKQVI-NNMQEVLIPKLKGWWQKFRLR 519 

1:1 : I 111111111:11 : I : I I : I I I : I I I : I : I I I : I : : I : 
611 GDYVYIFRSFRMEECAPGGCLMELCIQLSIIMLGKQLIQNNLFEIGIPKMKKLIRYLKLK 6 70 

52 0 SKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLN 5 79 

: : : I I I I I II I I : I I : : I I I I I I : I I I : I I I I I I I I I I 

6 71 QQSPPDHEECVKRKQRYEVDYNLEPFAGLTPEYMEMI IQFGFVTLFVASFPLAPLFALLN 730 

580 NWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRA- 638 

I : I I I I I I : I I I I I I I I I I I : I I I I I : : I I I : I I I I I I I : : : I : I I I : I I 
731 N 1 1 E I RLDAKKF VTE LRRPVAVRAKD I G I WYNI LRG I GKLAVI INAFVI SFTSDF IPRLV 7 90 

639 -YYRWTRAHDLRGFLNFTLARAPSSF AAAHN RTCRYRAFRD 6 78 

I : : : : I I : I I I III II : I I I : : I : 

791 YLYMYSKNGTMHGFVNHTL SSFNVSDFQNGTAPNDPLDLGYEVQICRYKDYREPPW 846 

6 79 DDGHY — SQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLA — 734 

: I | : : | : | | | | | | | | | | : : : | : : | : : | | | | : : : : : | | 
84 7 SENKYDISKDFWAVLAARLAFVIVFQNLVMFMSDFVDWVIPDIPKDISQQIHKEKVLMVE 906 

735 KQALAENEVLFGTNGTKDEQP KGSELSSH 763 

I I I I I I I I I I I I 

90 7 LFMREEQDKQQLL — ETWMEKERQKDEPPCNHHNTKACPDSLGSPAPSH 953 



RESULT 12 

US-1 1-4 43-42 8A-7 74192 

; Sequence 774192, Application US/11443428A 

; Publication No. US20070083334A1 

; GENERAL INFORMATION: 

; APPLICANT: Mintz, Liat 

; APPLICANT: Xie, Hanqing 

; APPLICANT: Dahari, Dvir 

; APPLICANT: Levanon, Erez 

; APPLICANT: Freilich, Shiri 

; APPLICANT: Beck, Nili 

; APPLICANT: Zhu, Wei-Yong 

; APPLICANT: Wasserman, Alon 

; APPLICANT: Hermesh, Chen 

; APPLICANT: Azar, Idit 

; APPLICANT: Bernstein, Jeanne 

; TITLE OF INVENTION: METHODS AND SYSTEMS USEFUL FOR ANNOTATING BIOMOLECULAR SEQUENCES 

; FILE REFERENCE: 02/23929 

; CURRENT APPLICATION NUMBER: US/11/443, 428A 

; CURRENT FILING DATE: 2006-05-31 

; NUMBER OF SEQ ID NOS : 1034312 
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; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 774192 
; LENGTH: 999 
; TYPE : PRT 

; ORGANISM: Homo sapiens 
US-11-443-428A-7 74192 



Query Match 34.9%; Score 1437; DB 6; Length 999; 

Best Local Similarity 38.7%; Pred. No. 1.6e-132; 

Matches 326; Conservative 161; Mismatches 263; Indels 92; Gaps 23; 

Qy 1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQ ASNWSAGLLAW 53 

: : I : : : : : : I I I I I I I : : I : I : : : I : I I I 
Db 170 EKDLENKSQGSIFVRIHAPWQVLAREAEFLKIKVPTKKEMYEIKAGGSIAKKFSAAL 226 

Qy 54 LGIPNVLLEWPDVPPEYYSCRFRVNKLP RFLGSDNQDTFFTSTKRHQILFEIL 107 

: : | | | | : : : : | : I I I I : I : I : I I I 

Db 227 QKLSSHLQPRV-PEHSNNKMKNLSYPFSREKMYLYNIQEKDTFFDNATRSRIVHEIL 282 



Qy 108 AKTPYGHEKKNLLGIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARW 16 7 

: I | : | | : | : | : | : | | | | | : : | : : | | : : | : | | | | : 

Db 283 KRTACS-RANNTMGINSLIANNIYEAAYPLHDGEYDSPEDD MNDRKLLYQEWARY 336 



Qy 16 8 GKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQE 22 7 



Db 337 GVFYKFQPIDLIRKYFGEKIGLYFAWLGLYTSFLIPSSVIGVIVFLYGCATIEEDIPSRE 396 



Qy 228 LCGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWK 286 

: I : : : I I I I I I I : I I I I I I III III: I I I I I : I I I I I I : I I I I 
Db 39 7 MCDQQNAFTMCPLCDKSCDYWNLSSACGTAQASHLFDNPATVFFSIFMALWATMFLENWK 456 



Qy 287 RKSATLAYRWDCSDYEDTEER PRPQFAA S APMTAPNP I T G 326 

I I I I I : I : I I I II:: II I 

Db 457 RLQMRLGYFWDLTGIEEEEERAQEHSRPEYETKVREKMLKESNQSAVQKLETNTTECGDE 516 



Qy 327 EDEPYFPERSRARRMLAGSWIWMVAVWMCLVSIILYRAIMAIWSRSGNTLLAAWAS 386 

: I I : I I I : I : I : : : I : I I I : I III 
Db 517 DDEDKLTWKDRFPGYLMNFASILFMIALTFSIVF GVIVYRITTAAALS LNKATRS 571 



Qy 387 RI ASLTGSWNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNF 443 

: : I : : I I I I I I I : I I : : I I I : I : : I : I I : I I I :: I I I 

Db 572 NVRVTVTATAVI INLWILILDEIYGAVAKWLTKIEVPKTEQTFEERLILKAFLLKFVNA 631 



Qy 444 YSSPVYIAFFKGRFVGYPGNYHTLF-GVRNEECAAGGCLIELAQELLVIMVGKQVI-NNM 501 

II 1:111111111 I I : I : I I I I I I I Nihil : I : I I : I I I : I I I : 
Db 632 YSPIFYVAFFKGRFVGRPGSYVYVFDGYRMEECAPGGCLMELCIQLSI IMLGKQLIQNNI 691 



Qy 502 QEVLIPKLKGWWQKFRLRSKKRKAGASAGA-SQGP — WEDDYELVPCEGLFDEYLEMVLQ 558 

I : : I I I I : I I : : I I : I I : I I : I I I I II I I : I I : : I 

Db 692 FEIGVPKLK KLFRKLKDETEAGETDSAHSKHPEQWDLDYSLEPYTGLTPEYMEMI IQ 748 



Qy 559 FGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTH 618 

11111:111: 1111:111111 : I : I I I I : I I I I III I I : I I I I I I I I : I : 
Db 749 FGFVTLFVASFPLAPVFALLNNVIEVRLDAKKFVTELRRPDAVRTKDIGIWFDILSGIGK 808 
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Qy 



619 LAV I S N AF L L AF S S D F L P RA Y YRWTRAHD — LRGFLNFTLA RAPSSFAA 665 



Db 



809 FSVISNAFVIAITSDFIPRLVYQYSYSHNGTLHGFVNHTLSFFNVSQLKEGTQPENSQFD 86 8 



Qy 



666 AHNRTCRYRAFRD DDGHYSQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIP 720 



Db 



86 9 QEVQFCRFKDYREPPWAPNPYEFSKQYWFILSARLAFVI IFQNLVMFLSVLVDWMIPDIP 92 8 



Qy 



Db 



721 ESVEIKVKRE YYLAKQALAENEVLFGTNGTKDEQPKGSELSSHWTPFTVPKA-S 773 

: : : I : I : : I : I : I I : I I : I : I I 

92 9 TDISDQIKKEKSLLVDFFLKE EHEKLKLMDEPALRSPGGGDRSRSRAASSAPSGQS 984 



Qy 



774 QL 775 



Db 



985 QL 986 



RESULT 13 

US-11-443-428A-739452 

; Sequence 739452, Application US/11443428A 

; Publication No. US20070083334A1 

; GENERAL INFORMATION: 

; APPLICANT: Mintz, Liat 

; APPLICANT: Xie, Hanqing 

; APPLICANT: Dahari, Dvir 

; APPLICANT: Levanon, Erez 

; APPLICANT: Freilich, Shiri 

; APPLICANT: Beck, Nili 

; APPLICANT: Zhu, Wei-Yong 

; APPLICANT: Wasserman, Alon 

; APPLICANT: Hermesh, Chen 

; APPLICANT: Azar, Idit 

; APPLICANT: Bernstein, Jeanne 

; TITLE OF INVENTION: METHODS AND SYSTEMS USEFUL FOR ANNOTATING BIOMOLECULAR SEQUENCES 

; FILE REFERENCE: 02/23929 

; CURRENT APPLICATION NUMBER: US / 1 1 / 4 43 , 428A 

; CURRENT FILING DATE: 2006-05-31 

; NUMBER OF SEQ ID NOS : 1034312 

; SOFTWARE: Patentln version 3.1 

; SEQ ID NO 739452 

; LENGTH: 800 

; TYPE : PRT 

; ORGANISM: Homo sapiens 
US-11-4 43-42 8A-739452 

Query Match 34.4%; Score 1417; DB 6; Length 800; 

Best Local Similarity 40.3%; Pred. No. 1.2e-130; 

Matches 326; Conservative 142; Mismatches 252; Indels 88; Gaps 22; 

Qy 23 LCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEWPDVPPEYYSCR 75 

II I I : I I : I : : : : I : I I I I : I I : : : I : I 

Db 7 LC — ARVLKLKMPTKKMYH — INETRGLLK — KINSVLQKITDPIQPKVAEHRPQTMKRL 6 0 
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76 FRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAE GVLS 132 

I I I I I : I : I I I I I : : I I I : I I : : I I Mill: 

61 SYPFSREKQHLFDLSD-KDSFFDSKTRSTIVYEILKRTTCTKAKYS-MGITSLLANGVYA 118 

133 AAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFA 192 

11:11111 : : I I : : I : : III: : I I I I : I I I : I I I I I : I I I I 

119 AAYPLHDGDY NGENVEFNDRKLLYEEWARYGVFYKYQPIDLVRKYFGEKIGLYFA 173 

193 WLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLC-LDCPFWLLSS 251 

I I I I I I : I I : : I I : I I I I I : : I I : I : I : : I I I I I I : I : I I 
174 WLGVYTQMLIPASIVGI IVFLYGCATMDENIPSMEMCDQRHNITMCPLCDKTCSYWKMSS 233 

252 ACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEE R 30 7 

I I I I : I III: I I I I I : I I I I I I : I : I I I I I I I I I : : I : I I 
234 ACATARASHLFDNPATVFF SVFMALWAATFMEHWKRKQMRLNYRWDLTGFEEEEEAVKDH 2 93 

308 PRPQFAA SAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAVWMCLVSI 362 

I I : : I I : | | | : I I I I I : I : I I : : : 

2 94 PRAEYEARVLEKSLKKESRNKET — DKVKLTWRDRFPAYLTNLVS I IFMIAVTFAIVLGV 351 

363 ILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEMHR 422 

I : I I II : : : : : : : I : : I I I I : : I : : I : I I I : I : : 

352 1 1 Y R I S MAAAL AMN S S P S VR S N I RVT VT AT AV I INLWI ILLDEVYGCIARWLTKIEVPK 411 

423 TQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLF-GVRNEECAAGGCL 481 

I : II: | | : : | | | | : I : I I I I I I I I I I I : I : I I I I I I I I I I 

412 TEKSFEERLIFKAFLLKFVNSYTPIFYVAFFKGRFVGRPGDYVYIFRSFRMEECAPGGCL 4 71 

4 82 IELAQELLVIMVGKQVI-NNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 54 0 
: I I : I : I I : I I I : I I I : I : I I I : I : : I : : : : I I I 

4 72 MELCIQLSIIMLGKQLIQNNLFEIGIPKMKKLIRYLKLKQQSPPDHEECVKRKQRYEVDY 531 

5 41 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVE IRLDARKFVCEYRRPVA 6 00 

I I II I I : I I : : I I I I I I : I I I : I I I I I I I I I I I : I I I I I I : I I I I I I I I I 
532 NLEPFAGLTPEYMEMI IQFGFVTLFVASFPLAPLFALLNNI IEIRLDAKKFVTELRRPVA 591 

6 01 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRA — YYRWTRAHDLRGFLNFTLAR 658 

I I : I I I I I :: I I I : I I I I I I I : : : I : I I I : I I I : : : : I I : I II 

5 92 VRAKDIGIWYNILRGIGKLAVI INAFVISFTSDFIPRLVYLYMYSKNGTMHGFVNHTL — 6 49 

659 APSSF AAAHN RTCRYRAFRD DDGHY — SQTYWNLLAIRLAF 697 

III II : | | | : : | : : | I : : I : I I I I I I 

650 — SSFNVSDFQNGTAPNDPLDLGYEVQICRYKDYREPPWSENKYDISKDFWAVLAARLAF 70 7 

6 98 VIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLA KQALAENEVLFGT 74 7 

I I I I : : : I : : I : : I I I I : : : : : I I III I 

708 VIVFQNLVMFMSDFVDWVIPDIPKDISQQIHKEKVLMVELFMREEQDKQQLL — ETWMEK 765 

748 NGTKDEQP KGSELSSH 763 

I I I I II II 

766 ERQKDEPPCNHHNTKACPDSLGSPAPSH 793 



RESULT 14 
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US-11-4 43-42 8A-73945 4 

Sequence 739454, Application US/11443428A 
Publication No. US20070083334A1 
GENERAL INFORMATION: 

APPLICANT: Mintz, Liat 

APPLICANT: Xie, Hanqing 

APPLICANT: Dahari, Dvir 

APPLICANT: Levanon, Erez 

APPLICANT: Freilich, Shiri 

APPLICANT: Beck, Nili 

APPLICANT: Zhu, Wei-Yong 

APPLICANT: Wasserman, Alon 

APPLICANT: Hermesh, Chen 

APPLICANT: Azar, Idit 

APPLICANT: Bernstein, Jeanne 

TITLE OF INVENTION: METHODS AND SYSTEMS USEFUL FOR ANNOTATING BIOMOLECULAR SEQUENCES 
FILE REFERENCE: 02/23929 

CURRENT APPLICATION NUMBER: US / 1 1 / 4 43 , 4 2 8A 
CURRENT FILING DATE: 2006-05-31 
NUMBER OF SEQ ID NOS : 1034312 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 739454 

LENGTH: 800 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-11-443-428A-739454 

Query Match 34.4%; Score 1417; DB 6; Length 800; 

Best Local Similarity 40.3%; Pred. No. 1.2e-130; 

Matches 326; Conservative 142; Mismatches 252; Indels 88; Gaps 22; 

Qy 23 LCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEWPDVPPEYYSCR 75 

II I I : I I : I : : : : I : I I I I : I I : : : I : I 

Db 7 LC — ARVLKLKMPTKKMYH — INETRGLLK — KINSVLQKITDPIQPKVAEHRPQTMKRL 6 0 

Qy 76 FRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAE GVLS 132 

I I I I I : I : I I I I 1 : : I I I : I I : : I I Mill: 

Db 61 SYPFSREKQHLFDLSD-KDSFFDSKTRSTIVYEILKRTTCTKAKYS-MGITSLLANGVYA 118 

Qy 133 AAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFA 192 

11:11111 : : I I : : I : : III: : I I I I : I I I : I I I I I : I I I I 

Db 119 AAYPLHDGDY NGENVEFNDRKLLYEEWARYGVFYKYQPIDLVRKYFGEKIGLYFA 173 

Qy 193 WLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLC-LDCPFWLLSS 251 

I I I I I I : I I : : I I : I I I I I : : I I : I : I : : I I I I I I : I : I I 
Db 174 WLGVYTQMLIPASIVGI IVFLYGCATMDENIPSMEMCDQRHNITMCPLCDKTCSYWKMSS 233 

Qy 252 ACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEE R 307 

I I I I : I III: 11111:111111 : I : I I I I I I I I I : : I : I I 
Db 234 ACATARASHLFDNPATVFFSVFMALWAATFMEHWKRKQMRLNYRWDLTGFEEEEEAVKDH 293 

Qy 308 PRPQFAA SAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAWVMCLVSI 362 

I I : : I I : | | | : I I I I I : I : I I : : : 

Db 294 PRAEYEARVLEKSLKKESRNKET — DKVKLTWRDRFPAYLTNLVSI IFMIAVTFAIVLGV 351 
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Qy 363 ILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEMHR 422 

I : I I II : : : : : : : I : : I I I I : : I : : I : I I I : I : : 

Db 352 IIYRISMAAALAMNSSPSVRSNIRVTVTATAVIINLWIILLDEVYGCIARWLTKIEVPK 411 

Qy 423 TQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLF-GVRNEECAAGGCL 481 

I : II: | | : : | | | | : I : I I I I I I I I I I I : I : I I I I I I I I I I 

Db 412 TEKSFEERLIFKAFLLKFVNSYTPIFYVAFFKGRFVGRPGDYVYIFRSFRMEECAPGGCL 471 

Qy 482 IELAQELLVIMVGKQVI-NNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 540 

: I I : I : I I : I I I : I I I : I : I I I : I : : I : : : : I I I 

Db 472 MELCIQLSI IMLGKQLIQNNLFEIGIPKMKKLIRYLKLKQQSPPDHEECVKRKQRYEVDY 531 

Qy 541 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 600 

I I II I I : I I : : I I I I I I : I I I : I I I I I I I I I I I : I I I I I I : I I I I I I I I I 
Db 532 NLEPFAGLTPEYMEMI IQFGFVTLFVASFPLAPLFALLNNI IEIRLDAKKFVTELRRPVA 591 

Qy 601 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRA — YYRWTRAHDLRGFLNFTLAR 658 

I I : I I I I I :: I I I : I I I I I I I : : : I : I I I : I I I : : : : I I : I II 

Db 592 VRAKDIGIWYNILRGIGKLAVI INAFVISFTSDFIPRLVYLYMYSKNGTMHGFVNHTL — 649 

Qy 659 APSSF AAAHN RTCRYRAFRD DDGHY — SQTYWNLLAIRLAF 697 

III II : I I I : : I : : | I : : I : I I I I I I 

Db 650 — SSFNVSDFQNGTAPNDPLDLGYEVQICRYKDYREPPWSENKYDISKDFWAVLAARLAF 707 

Qy 6 98 VIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLA KQALAENEVLFGT 74 7 

I I I I : : : I : : I : : I I I I : : : : : I I III I 

Db 708 VIVFQNLVMFMSDFVDWVIPDIPKDISQQIHKEKVLMVELFMREEQDKQQLL — ETWMEK 765 

Qy 748 NGTKDEQP KGSELSSH 763 

I I I I II II 

Db 766 ERQKDEPPCNHHNTKACPDSLGSPAPSH 793 



RESULT 15 

US-11-4 43-42 8A-739456 

; Sequence 739456, Application US/11443428A 

; Publication No. US20070083334A1 

; GENERAL INFORMATION: 

; APPLICANT: Mintz, Liat 

; APPLICANT: Xie, Hanqing 

; APPLICANT: Dahari, Dvir 

; APPLICANT: Levanon, Erez 

; APPLICANT: Freilich, Shiri 

; APPLICANT: Beck, Nili 

; APPLICANT: Zhu, Wei-Yong 

; APPLICANT: Wasserman, Alon 

; APPLICANT: Hermesh, Chen 

; APPLICANT: Azar, Idit 

; APPLICANT: Bernstein, Jeanne 

; TITLE OF INVENTION: METHODS AND SYSTEMS USEFUL FOR ANNOTATING BIOMOLECULAR SEQUENCES 

; FILE REFERENCE: 02/23929 

; CURRENT APPLICATION NUMBER: US/11/443, 428A 

; CURRENT FILING DATE: 2006-05-31 
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NUMBER OF SEQ ID NOS : 1034312 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 739456 
LENGTH: 853 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-11-443-428A-739456 

Query Match 34.3%; Score 1416; DB 6; Length 853; 

Best Local Similarity 40.9%; Pred. No. 1.6e-130; 

Matches 322; Conservative 142; Mismatches 248; Indels 76; Gaps 21; 

Qy 23 LCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEWPDVPPEYYSCR 75 

II I I : I I : I : : : : I : I I I I : I I : : : I : I 

Db 7 LC — ARVLKLKMPTKKMYH — INETRGLLK — KINSVLQKITDPIQPKVAEHRPQTMKRL 6 0 

Qy 76 FRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAE GVLS 132 

I I I I I : I : I I I I i : : I I I : I I : : I I Mill: 

Db 61 SYPFSREKQHLFDLSD-KDSFFDSKTRSTIVYEILKRTTCTKAKYS-MGITSLLANGVYA 118 

Qy 133 AAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFA 192 

I I : I I I I I : : I I : : I : : MM : II II M MM: I I I I 

Db 119 AAYPLHDGDY NGENVEFNDRKLLYEEWARYGVFYKYQPIDLVRKYFGEKIGLYFA 173 

Qy 193 WLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLC-LDCPFWLLSS 251 

I I I I I I M I : M I M I I I I : M M I M : : I I I I I I M M I 
Db 174 WLGVYTQMLIPASIVGIIVFLYGCATMDENIPSMEMCDQRHNITMCPLCDKTCSYWKMSS 233 

Qy 252 ACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEE R 307 

I I I I : I MM II II I M II II I M M II I I I I I I : M : I I 

Db 234 ACATARASHLFDNPATVFFSVFMALWAATFMEHWKRKQMRLNYRWDLTGFEEEEEAVKDH 293 

Qy 308 PRPQFAA SAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAWVMCLVSI 362 

II : : I I : | | | : I I I I I = 1 = 11 : : : 

Db 294 PRAEYEARVLEKSLKKESRNKET — DKVKLTWRDRFPAYLTNLVSI IFMIAVTFAIVLGV 351 

Qy 363 ILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEMHR 422 

MM II : : : : : : : I : : II I I : : I : : I : I I I : I : : 

Db 352 I IYRISMAAALAMNSSPSVRSNIRVTVTATAVIINLVVI ILLDEVYGCIARWLTKIEVPK 411 

Qy 423 TQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLF-GVRNEECAAGGCL 481 

I : IM | | :: || | |: M I I I I I I I I I I M I : I I I I I I I I I I 

Db 412 TEKSFEERLIFKAFLLKFVNSYTPIFYVAFFKGRFVGRPGDYVYIFRSFRMEECAPGGCL 471 

Qy 482 IELAQELLVIMVGKQVI-NNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 540 

Ml : I : I I : I I I : I I I : |: II I : I : : I : : : MM 

Db 472 MELCIQLSI IMLGKQLIQNNLFEIGIPKMKKLIRYLKLKQQSPPDHEECVKRKQRYEVDY 531 

Qy 541 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 600 

I I II I I : I I : : I I I I I I : II I : I I I I I I I I I I I : I I I I I I : I I I I I I I I I 
Db 532 NLEPFAGLTPEYMEMI IQFGFVTLFVASFPLAPLFALLNNI IEIRLDAKKFVTELRRPVA 591 

Qy 601 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRA — YYRWTRAHDLRGFLNFTLAR 658 

I I : I I I I I :: I I M I I I I I I I : : : i : I I M I I I : : : : MM II 
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Db 592 VRAKDIGIWYNILRGIGKLAVI INAFVISFTSDFIPRLVYLYMYSKNGTMHGFVNHTL — 649 

Qy 659 APSSF AAAHN RTCRYRAFRD DDGHY — SQTYWNLLAIRLAF 697 

III II : I I I : : I : : | I : : I : I I I I I I 

Db 650 — SSFNVSDFQNGTAPNDPLDLGYEVQICRYKDYREPPWSENKYDISKDFWAVLAARLAF 707 

Qy 6 98 VIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLA KQALAENEVLFGT 74 7 

I I I I : : : I : : I : : I I I I : : : : : I I III I 

Db 708 VIVFQNLVMFMSDFVDWVIPDIPKDISQQIHKEKVLMVELFMREEQDKQQLL — ETWMEK 765 

Qy 748 NGTKDEQP 755 

III I 

Db 766 ERQKDEPP 773 



Search completed: June 24, 2008, 15:37:43 
Job time : 350 sees 
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